Modeling Textual Cohesion for Event Extraction

نویسندگان

  • Ruihong Huang
  • Ellen Riloff
چکیده

Event extraction systems typically locate the role fillers for an event by analyzing sentences in isolation and identifying each role filler independently of the others. We argue that more accurate event extraction requires a view of the larger context to decide whether an entity is related to a relevant event. We propose a bottom-up approach to event extraction that initially identifies candidate role fillers independently and then uses that information as well as discourse properties to model textual cohesion. The novel component of the architecture is a sequentially structured sentence classifier that identifies event-related story contexts. The sentence classifier uses lexical associations and discourse relations across sentences, as well as domain-specific distributions of candidate role fillers within and across sentences. This approach yields state-of-the-art performance on the MUC-4 data set, achieving substantially higher precision than previous systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Study of the Effectiveness of Textual Cohesion of Teaching Materials on Iranian Intermediate EFL Learners’ Reading Comprehension

The present investigation was an attempt to study the effect of difference in textualcohesion of different teaching materials on Iranian intermediate EFL learners' readingcomprehension. To that end, a QPT test was administered to 105 EFL students learningEnglish language in institutes. Based on QPT test direction individuals who get 31+ ingrammar and vocabulary, 8+ in re...

متن کامل

Text Mining Support in Semantic Annotation and Indexing of Multimedia Data

This short paper is describing a demonstrator that is complementing the paper “Towards Cross-Media Feature Extraction” in these proceedings. The demo is exemplifying the use of textual resources, out of which semantic information can be extracted, for supporting the semantic annotation and indexing of associated video material in the soccer domain. Entities and events extracted from textual dat...

متن کامل

Re-Implementing Apache Thrift using Model-Driven Engineering Technologies: An Experience Report

In this paper we investigate how contemporary model-driven engineering technologies such as Xtext, EMF and Epsilon compare against mainstream techniques and tools (C++, flex and Bison) for the development of a complex textual modelling language and family of supporting code generators (Apache Thrift). Our preliminary results indicate that the MDE-based implementation delivers significant benefi...

متن کامل

05. Linguistic analysis of impairment data. 05.03. Textual analysis of impaired speech samples: 05.03.02. Textual cohesion at morphosyntactic level: agrammatism and paragrammatism. Indices of syntactic complexity. Textual cohesion: morphology and syntaxis in flexive languages

Although we define textual cohesion as a pragmatic characteristic, its dependence on grammar is obvious. Ángel Herrero defines it as follows: "A set of linguistic mechanisms used by a text to ensure the explicit cohesion of its parts. Cohesion enables the linguistic interpretation of an element in the text (a morpheme, a word, a construction) to take place thanks to other elements in the same t...

متن کامل

Towards a Textual Cohesion Model that Predicts Self-Explanations Inference Generation as a Function of Text Structure and Readers’ Knowledge Levels

The Interactive Strategy Trainer for Active Reading and Thinking (iSTART) is an intelligent tutoring system that provides students with automated training on reading strategies. In particular, iSTART trains students to selfexplain target sentences so as to integrate encoded information into a coherent mental representation. The goal of this study was to investigate the relation between text str...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012